1. clone the github repository https://github.com/haotian-liu/LLaVA.git
2. Install all the dependencies from requirements.txt 
3. Apply caption_gen.py to generate the captions